arXiv:1503.03459vl [math.OC] 11 Mar 2015 


Robust generation of elementary flux modes 


Hildur iEsa Oddsdottir Erika Hagrot^, Veronique Chotteau 

and Anders Forsgren^ 

^Department of Mathematics, Optimization and Systems Theory, 
KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden 
^Division of Industrial Biotechnology/Bioprocess Design, KTH 
Royal Institute of Technology, Albanova Center, SE-106 91 
Stockholm, Sweden 

Received: March 12, 2015/ 


Abstract 

Elementary flux modes (EFMs) are vectors defined from a metabolic 
reaction network, giving the connections between substrates and prod¬ 
ucts. EFMs-based metabolic flux analysis (MFA) estimates the flux over 
each EFM from external flux measurements through least-squares data 
fitting. In previous work we presented an optimization method of column 
generation type that facilitates EFMs-based MFA when the metabolic re¬ 
action network is so large that enumerating all EFMs is prohibitive. In 
this work we extend this model by including errors on measurements in 
a robust optimization framework. In the robust optimization problem, 
the least-squares data fitting is minimized subject to the error on each 
metabolite being as unfavourable as it can be, within a given interval. In 
general, inclusion of robustness may make the optimization problem sig¬ 
nificantly harder. However, we show that in our case the robust problem 
can be stated as a convex quadratic programming problem, i.e., of the 
same form as the original non-robust problem. Additionally, we demon¬ 
strate that the column-generation technique of the non-robust problem 
can be extended also to the robust problem. Furthermore, the option to 
indicate intervals on metabolites that are not measured is introduced in 
this column generation framework. The effect of including robustness in 
the model is evaluated in a case-study, which indicated that the solutions 
of our non-robust problems are in fact near-optimal also when robustness 
is considered. On the other hand, the addition of intervals on unmeasured 
metabolites resulted in a change of optimal solution. Implying that the in¬ 
clusion of intervals on unmeasured metabolites is more important than the 
direct consideration of measurement errors, this despite up to 20% errors. 
Keywords: Metabolic Network; Robust Optimization; Least-squares; 
Elementary Flux Mode; Chinese Hamster Ovary Cell 
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1 Introduction 


Q In previous work we presented a column generation based a lgorithm for solv¬ 
ing t he EFMs-based metabolic flux analysis (MFA) problem (lOddsdottir et ai 
2014il . In this work we present a more refined model where the column gener¬ 
ation algorithm is combined with robustness. For the sake of completeness a 
short description of the background foll ows. A more detailed background can 


be found in e.g., lOddsdottir et all (1201411 . A metabolic reaction network is rep¬ 
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resented by the stoichiometric matrix A, which together with the flux vector 
(v) gives the overall change in concentration of each metabolite (C). The rows 
of the stoichiometric matrix (A) refer to either external metabolites (A^) or 
internal (Ai). The flux space is given by a set of vectors v that satisfy the 
pseudo-steady state assumption and flow direction assumption, 


( 1 ) 


where Ij is a reduced identity matrix with ones only when j G Jirrev and Jirrev 
is the set of irreversible reactions. When all reactions in the network are ir¬ 
reversible o is a cone where an y ray can be written as a non -negative linear 
combination of the extreme rays ( Nemhauser and Wolsev . 19991 Part 1.4 Theo¬ 
rem 4.8). 

FFMs contain information how extracellular metabolites are c onnected by _ 

detai ling which reactions are required for their uptake or production (iLlaneras and Pico 
I 2 OIOII . They are vectors in the flux space, each FFM includes only a minimial 
set of reactions and is nondecomposable ( Klamt and Stellin3 . 2002[l . Further, 
any vector in the flux space can be denoted as a non-n egative linear combination 
of the FFMs ( Schilling et all . 19991 : Papin et al . 2003 1. 


V = wici = Ew, 7 > 0, 
1=1 


( 2 ) 


where e denotes a single FFM and the matrix E contains the FFMs as columns. 
In this sense the FFMs generate the flux space and are related to the def¬ 
inition of extreme rays in the cone © with only irreversible reactions. In 
fact when a metabolic network only has irre versible reactions the FFM s and 
the extreme rays of the cone ([T|) are equal ( Gagneur and Klamt . l2004ll . We 
assume, without loss of generality, that the metabolic network has only irre¬ 
versible reactions, i.e., Vj > 0 Vj. When the network includes reversible reac¬ 
tions finding all the FFMs is equivalent to finding all the e xtreme rays of a cone 
in an extended space where all rea ctions are irreversible (jGagneur and KlamtI . 
2004 lUrbanczik and Wagner , 20051) . For modest-sized networks enumeration of 
FFMs is possible and computer programs exist for that purpose, e.g., Metatool 
( von Kamo and Schiister , 120061) . Howe ver, with increased netw ork size enu¬ 
meration of FFMs becomes prohibitive ( Klamt and Stellingl. 2002 ). Thu s focus 


has shifted t o identify only a subset of the FFMs (Ide Figueiredo et al 
Kaleta et al . 20091 : Tabe-Bordbar and Marashi . 2013 ). 


20091: 


Abbreviations: MFA, metabolic flux analysis; EFMs, elementary flux modes; CHO, Chi¬ 
nese hamster ovary; Lac, lactate; Glc, glucose 
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This work co nsiders the solu tion of the EFMs-based metabolic flux analysis 
(MFA) problem ( Provost . 20061 Chapter 5.2) when the network is large and 
there are known bounds on measurement errors. FFMs-based MFA uses the 
decomposition of v given by m to create a macroscopic network (AxE). The 
macroscopic fluxes (w) are then adjusted so that the flux in the network fit the 
cell specific external flux measurements (Q), i.e., 


minimize —\\Q — IAxEw\\ 


subject to 


2 ' 
w > 0. 


(3) 


The formulation given by © includes multiple repetitions of the same exper¬ 
iments, i.e., if Qk are results from one repetition, k, then = [g^,...qj], 
where d denotes the number of repetitions. X is a stacked identity matrix con¬ 
sisting of d identity matrices of size M^xt (number of external metabolites) or 
I = [iMext > • ■ ■ ) where /Mext is repeated d times. 

FFMs-based MFA as given by ([3]) requires the whole set of FFMs, limit¬ 
ing the application to simplified networks. Methods that can solve the FFMs- 
based MFA problem without enumerating FFMs exist. One method identi- 
fies FFMs beforehan d through a series of linear programming (LP) problems 
( .Tunerers et aj 2011 1. This method is based on the existence of a feasible flux 
vector V, an assumption we will examine in Section [5] In our previous work 
we introduced a more integrated approach that enables ident ification of FFMs 
in co njunction with solving the FFMs-based MFA problem (jOddsdottir et all . 
20141) . The approach was based on an opti mization technique named column 
generation ( Liibbecke and Desrosierj . 200^, in which large networks can be 
handled by relying on a master problem and a subproblem that are solved iter¬ 
atively. The subproblem gives the master problem a new column every iteration 
until the solution of the subproblem indicates that the solution of the master 
problem is optimal to the full optimization problem. 

The experimental measurements used to calculate the fluxes in Q in the 
FFMs-based MFA problem (151) are pro ne to errors, which have been stated to 
reach at least 20% ( Goudar et aj 20091) . For this reason we wanted to consider 
the sensitivity of the solution with respect to these errors. Additionally, in some 
cases certain metabolites included in the network, are difficult to measure and 
thus remain unmeasured in the data set. Fven though those metabolites are 
unmeasured in this specific experimental setup some information on their fluxes 
can be available, and a bound can be added. We there fore present an extensio n 
to our previous column generation algorithm given bv lOddsdottir et aJ ( 20141) . 
This extension includes both a robust formulation and a version that deals with 
unmeasured metabolites, while still having the benefit of working with larger 
networks. In the robust formulation the error on each measurement is assumed 
bounded, while unmeasured metabolites are given a feasible interval. 

In the robust formulation the aim is to minimize the objective function when 
the assumed errors are such that the objective is as disad vantageous as it can 
be . For more informati on on robust optimization please see Mulvev et all ( 1995 ) 


or 


Ben-Tal et al (200^. Previous work on robust least-squares mainly focus on 


errors in both the measurements and the model, in general those f ormulations 
are difficult to solve (NP complete) ( FI Ghaoui and Lebreti Il997l) . However, 
we show that for this special case, where the errors are only in measurements 
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and bounded by an interval, the robust problem can be formulated as a convex 
quadratic programming (QP) problem. Furthermore, column generation can 
be applied to this QP, allowing the problem to be solved without previous 
enumeration of EFMs. 

The paper is outlined as follows. In Section [2] it is shown how the stacked 
least-squares can be written as least-squares of averages along with an example, 
showing that metabolic reaction networks do not necessarily have a feasible 
flux vector for a given set of external measurements. Then we present the 
main results of this paper in Section [31 a robust version of the EFMs-based 
MFA, where column generation can also be applied, along with a version in 
which intervals for unmeasured metabolites are included. Finally in Section [4] 
we present some results comparing the solutions of the robust problem to the 
EFM-based MFA. 


2 On the Feasibility of the EFMs-based MFA 

In this section we examine the uniqueness of the stacked EFMs-based MFA 
and if there always exists a flux vector that fits the network and measurements 
exactly. These observations support our main results shown in Section [31 To 
simplify the discussion we consider a problem equivalent to the EFMs-based 
MFA where a flux vector v is sought, 

... 1 2 
minimize — IIIAjjU — QIU 

V 2 

subject to AiV = 0, 
u > 0. 


Problem is equivalent to by using the decomposition of v given by 
thus removing the equality constraint. With the stacking of multiple mea¬ 
surements the objective function of seems to represent an overdetermined 
problem. However, problem (|3]) can be represented as if it only has one mea¬ 
surement, or as an underdetermined problem by, 

d d d 

\\IA^v - QWl = ^ \\A^v - QkWl = dv'^AlAa;V -2'^qlA^v + ^qlqk. 


Thus, the solution of dH) is equal to the solution of 


minimize 

V 

subject to 



AiV = 0 , 


V > 0. 


(5) 


Consequently, for a given experimental condition, stacking repetitions is equal 
to using the average value of the flux measurements. 

In light of that the data fitting can equivalently use the average, ie., only 
one measurement, it becomes important to consider if there always exists a 
solution to ® with zero residual. That is, if 

: A^xV = <li AiV = 0, u > 0, for any q. (6) 
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For robustness the existence of a solution is especially relevant, because when 
there is only one measurement that fits the network exactly robust optimization 
will not give a different solution from the non-robust solution. Although, it 
should be noted that when there are repetitions, or multiple measurements, the 
solution of the robust optimization can differ from the non-robust solution. 

Previous analysis of calculability in networks have considered when there 
exists a unique v that satisfies (O without the positivity constraint. Hence 
examining if a network is underdetermined or determined. In general a full 
rank matrix has the whole of TZ as its range, indicating that there always exists 
a V such that A^v = q and AjV = 0. When the netw ork is underdetermined 
this V would not be unique ( Klamt and Schnsteil . 1200^ . However, this assumes 
that V can be negative in all values. In metabolic networks reactions are often 
restricted to only one direction. Hence, an underdetermined network may not 
have a solution for all sets of measurements. A small example of how this can 
happen follows. 



Figure 1: A reaction network with underdetermined stoichiometry. 

Consider the network shown in Figure [TJ If one external metabolite is not 
measured then the network has underdetermined stoichiometry, and thus, there 
exists a v such that A^v = q and AiV = 0 for any q. However depending on 
which external metabolite is not measured v might not be positive. 

• If is not measured, then, depending on what the measurements are, 
there might not exist a f > 0 that satisfies the stoichiometry. With no 
measurements on vi is free, however V 2 ,vs and vg are fixed from 
measurements. Flow balance requires that vg vg, A vg, additionally if 
V 2 < Vg then the flow through C 4 cannot be fulfilled. Thus, if vg is too 
high flow balance can not be fulfilled and no feasible v exists. 

• If C 2 is not measured, then vg is free, and can be chosen so that the 
flow to C 7 and Cg is satisfied, note that any lack of flow from vi can be 
compensated by sending through vg. 

Hence, errors in measurements can lead to nonexistance of a flux vector for the 
given network that fits the measurements exactly. 
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3 The Robust Variant of the EFMs-based MFA 


This section contains the main results of this work, here we present an extension 
of the EFMs-based MFA problem, where errors in Q are taken more directly 
into consider ation. For this purpose we make u s e of a technique named robust 
optimization ( Mulvev et al 1995 : Ben-Tal et all . 200911 . 

The robust optimization problem is to minimize the residual when the errors 
in the data give a worst-case scenario outcome, i.e., the errors in the data are 
such that the residual is maximized. Inherent in least-squares is the assumption 
that the errors are bounded by the two-norm, i.e., || AQ|| < /3. In fact, when the 
errors are assumed bounded by the two norm, the least squares problem gives the 
same solution as its robust variant. However, in this work we assume that the 
errors in Q are bounded by an interval, a more restrictive assumption that might 
cause the solution to change. The interval is such that Qreai = Q + AQ where 
AQi = [Aqf,... AqJ]'^ and \Aqki\ < Oki \qki\, k refers to a specific repetition 
and i to the metabolite. In order to simplify notation 9 is stacked in the same 
way as Q and AQ, the subindex s then refers to a specific element in those 
vectors. Note that in general the percentage of error on each metabolite is the 
same for all repetitions, i.e., Ok^i = 9k2i for all ki and k 2 . The robust problem 
is then given by 


1 


minimize maximize 
w>0 |AQq<e,|Q,| 


\\IA,Ew -Q + AQII. 


(7) 


As shown in Appendix El problem © can equivalently be formulated as a 
quadratic programming problem in the form 


minimize 

10,t 


\\lAj;EW — I 




subject to tg — {lA^Ew — Q)g9sQs>Q, Vs 
tg + (IA,gEw - Q)^ OgQs > 0, Vs 
w > Q. 


( 8 ) 


When 0 = 0 the above formulation is equivalent to the EFMs-based MFA ([3]) . 

To make the notation more compact we define 0 and Q as diagonal matrices 
with 9 and Q on the diagonal, respectively. Further, the objective function of 
([S]) can be stated as minimizing the average measure of q over all measurements. 


minimize 

W,t 

subject to 


1 

Ag^Ew - - ^ 


Qk 


fe=l 




t - eq {TAggEw - Q) > 0, 
t -I- QQ {lAxEw — Q) >0, 
w > 0, 


(9) 


The formulation in ([^ shows that even when the average value gives a zero norm 
solution of the EFMs-based MFA, the robust solution might be different. The 
reason for this difference can be seen when the constraints in Q are examined. 
For multiple measurements of the same metabolites {lA^Ew — Q)^ for each 
specific measurement will in general not be equal to zero for all s, forcing t to 
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increase from zero. With enough increase in t the robust solution might deviate 
from the non-robust solution, i.e., increasing \\TAxEw — Q\\ while decreasing 
t. Thereby, giving a non-zero value of \\IAxEw — Q|| the robust solution. The 
change in the solution depends on two factors, how far the measurement is from 
the best least-squares calculated flux and how high the error on that measure¬ 
ment is assumed to be. No change in the optimal solution is expected when 
either the measurements are good or the assumed interval is tight, since then t 
can remain close to zero. 

3.1 Column Generation of the Robust Variant of EFMs- 
based MFA 

For large networks enumerating all EFMs beforehand is prohibitive. For that 
reason, we present two problems: A master problem and subproblem that can 
be solved iteratively to identify the necessary EFMs along with solving problem 
([5]) . Their derivation can be seen in appendix!^ The master problem is given 

by 


minimize 

w,t 

i WIA^Ebwb - Qf + 

(10a) 

subject to 

t - QQ {IA^Ebwb - Q) > 0, 

(10b) 


t QQ {IAxEbwb - Q) > 0, 

(10c) 


Wb > 0, 

(lOd) 


where the index B indicates that only the known columns of E are used. The 
corresponding subproblem requires information from the solution of the master 
problem. More specifically the macroscopic fluxes, wb and the dual solutions. 
Am and Xp corresponding to the constraints (llObI) and (llOcI) respectively, are 
required. The subproblem is given by 


minimize (^A^EeWb — Q + QQXm — OQ\j ^A^e 

subject to AiC = 0, 

l'^e= 1, 

Cj > 0 Vj. 


( 11 ) 


The subproblem (fTTl) identifies EFMs ( Oddsdottir et all 2014ll until the objec¬ 
tive function value is non-negative. At that stage the optimal solution of the 
master problem is also the optimal solution of the full problem. 


3.2 Inclusion of Intervals on Unmeasured Metabolites in 
the Robust Variant 

In this section a further extension of the EFMs-based MFA is introduced, where 
unmeasured metabolites are taken into consideration. Unmeasured metabolites 
are external metabolites that are a part of the network used but have no mea¬ 
surement data. An example is the metabolite CO 2 a gas that is difficult to 
measure without special experimental setup. Intervals are estimated on those 
metabolites and modelled with a penalty function. In this way the intervals 
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are allowed to be infeasible for the first few iterations of the column generation. 
A robust optimization problem that considers feasible intervals on unmeasured 
metabolites can be stated as, 

minimize (MJ max(0, (—+ Ax^nEw)) 

+ Mf max(0, - Ax,nEw))+ ( 22 ) 

maximize - \\XAxEw — O + AQII | , 

|AQ|fc<efc|Qfc| 2 J 

where Ax^n are the rows from the stoichiometric matrix that correspond to the 
unmeasured metabolites, and Ql^ are the upper and lower bounds on the 
given interval respectively. The quantities and Mi indicate how large the 
penalty is for not satisfying the specific interval constraint. In general Mu and 
Ml will be set to a sufficiently large number by the user. The inner maximization 
problem is unchanged from © and hence, (II3 can be represented as a convex 
quadratic programming problem, 


minimize 

i WlAxEw-Qf + l^t + M^z^ + Mfz^ 

(13a) 

subject to 

t - QQ {lAxEw - Q) > 0, 

(13b) 


t + QQ {XAxEw — Q) > 0, 

(13c) 


2 ;“ - Ax,nEw > -Q“, 

(13d) 


+ Ax^nEw > Qu, 

(13e) 


> 0, 

(13f) 


IV 

0 

(13g) 


ru > 0. 

(13h) 


The formulation from m can be solved using column generation where the 
subproblem generates columns of E by, 

minimize (jIAxEbWb — Q + QQXm — QQXpf^IAx 
A/) Ax^n) 6 

subject to AiC = 0, 
l^e < 1, 
ej > 0 Vj, 

where A„ and A/ are the dual variables corresponding to constraints (jl3dl) and 
(I13el) . 


4 Case-Study: Cultivation of CHO Cells 


4.1 Particulars of the Data 


Data w ere obtained from the same experimental setup as described in lOddsdottir et al 
( 20141) . A Chinese hamster ovary (CHO) cell line producing a monoclonal an¬ 
tibody (mAb) was cultivated during 11 days according to a pseudo-perfusion 








protocol (daily saniple collection and medium exchange) to imitate steady-state 
conditions. The cultivation was carried out in parallel cultures using different 
medium compositions. Cell-specific metabolic rates (external fiuxes) were cal¬ 
culated for the last seven days of culture. Two different media were selected for 
the present work in order to show extreme situations of our findings, the result¬ 
ing fiuxes are presented in Tables [1] (Medium 1) and [5] (Medium 5). In addition 
to the measured data, an interv al on COq flux was esti mated as 4.95 — 7.09, 
based on the intervals given by ICoudar et all (I 2 OIIII and lAunins and Henzler 
(l2nnRll . 


Metabolite 

Q6,l 

Q 7 ,l 

98,1 

99.1 

910,1 

911,1 

912,1 

Ala 

0.45 

0.51 

0.44 

0.40 

0.40 

0.41 

0.46 

Arg 

-0.27 

-0.26 

-0.14 

-0.19 

-0.11 

-0.47 

-0.22 

Asn 

-0.17 

-0.22 

-0.17 

-0.18 

-0.20 

-0.18 

-0.15 

Asp 

0.04 

0.07 

0.07 

0.06 

0.06 

0.07 

0.07 

Biomass 

0.61 

0.60 

0.50 

0.70 

0.53 

0.56 

0.65 

Cys 

-0.09 

-0.11 

-0.07 

-0.09 

-0.09 

-0.07 

-0.05 

Glucose (Glc) 

-3.52 

-4.06 

-2.64 

-3.26 

-3.96 

-2.92 

-3.43 

Gin 

-1.60 

-1.97 

-1.61 

-2.38 

-2.31 

-1.90 

-1.71 

Glu 

0.22 

0.32 

0.25 

0.27 

0.30 

0.33 

0.28 

Gly 

0.04 

0.07 

0.05 

0.03 

0.03 

0.05 

0.03 

His 

-0.02 

-0.05 

-0.02 

-0.01 

-0.01 

-0.01 

-0.02 

He 

-0.10 

-0.13 

-0.10 

-0.10 

-0.11 

-0.12 

-0.11 

Lactate (Lacj 

5.48 

7.40 

5.89 

6.20 

6.78 

7.02 

6.00 

Leu 

-0.19 

-0.22 

-0.17 

-0.17 

-0.18 

-0.20 

-0.19 

Lys 

-0.05 

-0.05 

-0.06 

-0.05 

-0.08 

-0.07 

-0.04 

Met 

-0.05 

-0.07 

-0.04 

-0.05 

-0.06 

-0.04 

-0.03 

NH+ 

1.17 

- 

1.17 

1.15 

1.23 

1.24 

1.18 

Phe 

-0.10 

-0.12 

-0.10 

-0.09 

-0.09 

-0.12 

-0.12 

Pro 

-0.10 

-0.14 

-0.09 

-0.11 

-0.11 

-0.13 

-0.10 

Ser 

-0.00 

0.01 

-0.01 

0.00 

0.00 

-0.03 

0.01 

Thr 

-0.11 

-0.10 

-0.11 

-0.10 

-0.12 

-0.10 

-0.07 

Trp 

-0.03 

-0.07 

-0.02 

-0.03 

-0.03 

-0.02 

-0.02 

Tyr 

-0.09 

-0.12 

-0.08 

-0.08 

-0.08 

-0.12 

-0.10 

Val 

-0.15 

-0.17 

-0.13 

-0.14 

-0.15 

-0.16 

-0.13 

niAb 

2.1e-04 

2.3e-04 

1.8e-04 

1.8e-04 

1.7e-04 

2.2e-04 

1.9e-04 


Table 1: External fiuxes obtained from a CHO cell cultivation, given for each 
metabolite from the final seven days of the cultivation for medium 1. The unit 
is pmol • cell~^ • day“^, except for Biomass which has the unit day“^. 


4.1.1 Particulars of the Error on the Data 


The errors on the measurements (AQ) are assumed bounded by an error param¬ 
eter 9s that varies for each metabolite but remains constant between repetitions, 
i.e., lAQIj, < 9s IQIs- The estimation of the error parameter was mostly based 
on the estimated errors of experimental measurements, along with the evalu- 
ated variance in th e data set. Finally, consistency with the analysis given by 
Goudar et all ( 2009l l was ensured. The values of 9 for each metabolite are given 
in Table [3] 


4.2 Description of the Metabolic Network 

The network used in this study is based on a network available in the literature 
( Zamorano River^ 20121 Section 2.2). The network was extended in several 
ways to better fit this study. More reactions were made reversible and some 
transport reactions were added. The final network consists of 101 reactions. 
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Metabolite 

96,5 

97,5 

98,5 

99,5 

910,5 

911,5 

912,5 

Ala 

0.43 

0.45 

0.46 

0.50 

0.44 

0.40 

0.43 

Arg 

- 0.45 

- 0.57 

-0.22 

-0.17 

-0.11 

-0.23 

-0.51 

Asn 

-0.18 

-0.22 

-0.20 

-0.22 

-0.21 

-0.19 

-0.17 

Asp 

0.07 

0.08 

0.08 

0.08 

0.09 

0.09 

0.08 

Biomass 

1.11 

0.57 

0.55 

0.59 

0.52 

0.55 

0.56 

Cys 

-0.11 

-0.14 

-0.12 

-0.13 

-0.11 

-0.09 

-0.12 

Glc 

-3.56 

-3.22 

-3.14 

- 2.79 

-3.23 

-3.18 

-3.19 

Gin 

-1.79 

-1.71 

-1.60 

-2.41 

-1.81 

-1.85 

-1.77 

Glu 

0.25 

0.31 

0.22 

0.24 

0.22 

0.23 

0.21 

Gly 

-0.00 

-0.06 

0.01 

0.04 

0.03 

0.03 

0.04 

His 

-0.03 

-0.03 

-0.04 

-0.03 

-0.03 

-0.03 

-0.03 

He 

-0.13 

-0.20 

-0.11 

-0.11 

-0.11 

-0.11 

-0.13 

Lac 

6.70 

6.02 

5.41 

6.24 

5.61 

5.82 

5.84 

Leu 

-0.21 

-0.30 

-0.19 

-0.19 

-0.18 

-0.19 

-0.22 

Lys 

-0.09 

-0.15 

-0.04 

-0.04 

-0.06 

-0.09 

-0.07 

Met 

-0.05 

-0.07 

-0.06 

-0.05 

-0.06 

-0.04 

-0.04 

NH+ 

1.37 

1.45 

1.26 

1.33 

1.31 

1.11 

1.25 

Phe 

-0.12 

-0.11 

-0.10 

-0.11 

-0.09 

-0.07 

-0.13 

Pro 

-0.17 

-0.23 

-0.12 

-0.11 

-0.12 

-0.12 

-0.15 

Ser 

-0.09 

-0.06 

-0.01 

0.03 

0.00 

0.01 

0.01 

Thr 

-0.14 

-0.23 

-0.13 

-0.11 

-0.13 

-0.11 

-0.10 

Trp 

-0.03 

-0.03 

-0.03 

-0.03 

-0.03 

-0.02 

-0.02 

Tyr 

-0.12 

-0.12 

-0.10 

-0.11 

-0.08 

-0.07 

-0.06 

Val 

-0.18 

-0.24 

-0.15 

-0.15 

-0.15 

-0.15 

-0.18 

mAb 

2.3e-04 

2.3e-04 

1.8e-04 

2.2e-04 

1.6e-04 

2.0e-04 

2.1e-04 


Table 2: External fluxes obtained from a CHO cell cultivation, given for each 
metabolite from the final seven days of the cultivation for medium 5. The unit 
is pmol • cell~^ • day“^, except for Biomass which has the unit day“^. 


whereof 29 are reversible, and 100 metabolites, whereof 28 are external. Metabo¬ 
lites that are included in the network but not measured are CO 2 , Choline, and 
Ethanolamine. 

In some experiments an external metabolite concentration is set to zero in the 
medium. This does not exclude the optimal solution from using that metabolite 
in the optimal solution. Hence, in order to get a solution that fits better with 
the experimental set-up, columns of corresponding to reactions from those 
metabolites are removed, thus blocking the optimal EFMs from using those 
reactions. For media 1 and 5 those metabolites are mAb. 


4.3 Technicalities on Normalization 


The results are presented based on a normalized version of the EFMs-based 
MFA. The normalized version aims at fitting the network with the measure¬ 
ments divided by the average value for each specific metabolite in the medium 
considered. The network is normed similarly by dividing each row of with 
the average of the measurement for the corresponding metabolite in the medium 
considered. Thus if the average value is defined as. 


<^^,g = 


E 


k^l 


Qi,k,g 

d 


then the external network {A^) and measurements {Q) are redefined as follows, 


Gij 


Qi,k,g 
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Metabolite 

Error {0 [%]) 

Ala 

13.04 

Arg 

17.25 

Asn 

20.36 

Asp 

13.72 

Biomass 

17.42 

Cys 

17.61 

Glc 

14.73 

Gin 

15.39 

Glu 

13.73 

Gly 

15.47 

His 

17.10 

He 

15.31 

Lac 

17.52 

Leu 

15.49 

Lys 

14.55 

Met 

13.78 

NH+ 

13.96 

Phe 

16.05 

Pro 

15.29 

Ser 

15.94 

Thr 

15.71 

Trp 

15.01 

Tyr 

13.58 

Val 

23.05 

mAb 

18.57 


Table 3: The percentage error on each 
metabolite, 6i (%) 


Where, is an element from and Jext represents the set of all measured 
external metabolites in the network. When < 0.02 the value is replaced 
with \qi^g \ = 0.02, in order to avoid dividing by too small values. This minimum 
is chosen to affect only a few metabolites. For metabolite 1 this affects mAb 
and Ser, for metabolite 5 this affects those same metabolites along with Gly. 

4.4 Results and Discussion 

In this section the results for two experimental conditions using two different 
media are given. In order to demonstrate the difference of the EFMs-based 
MFA with and without robustness the flux over each EFM and the flux to each 
external metabolite for three levels of error are shown. The levels of error are 0%, 
5%, and 100% of the 9 error given in Table[31 Additionally, the effects of adding 
an interval are examined by considering the results with a given interval on CO 2 
for 6 equal either to zero or 100%. The 0% error interval is equivalent to the 
EFMs-based MFA without robustness. The solution with 100 % of error interval 
is referred to as the robust solution. The results are shown in the following tables 
and figures, where Tables|l]and|n]show the flux over each EFM for medium 1 and 
5 respectively. Furthermore, the value of the objective functions for the EFMs- 
based MFA with and without robustness is shown. Figures [2] and [3l show in 
the same manner, the flux over each EFM where the flux has been normed 
with respect to the flux given by the robust solution, this gives an overview of 
how different the fluxes are for each error interval. Tables [S] and [7] are similarly 
constructed but show the flux to each external metabolite. 

Considering the results for medium one shown in Table 0] it can be noted that 
introducing robustness has no impact on which EFMs are required. Further, 
the change in the norm and robust norm is low, indicating that the impact of 
robustness on the solution is low. When the flux to each metabolite shown in 
Table 0 is considered, there are some minor adjustments in the solution when 
robustness is added, however the larges change comes when an interval on CO 2 
is enforced, resulting in a lower flow to CO 2 . 


II 




EFM 

Macroscopic Reaction 

Wq 

^ 0.05 

W\ 

yjQ,inv 

1 

0.5 Glu ^ 0.5 Ala + 1 CO 2 

5.71 

5.84 

5.90 

4.36 

2 

0.5 Glc ^ 1 Lac 

4.96 

4.86 

4.81 

4.45 

3 

1 Asn 1 Lac 

1.58 

1.58 

1.58 

1.58 

4 

0.5 Glc + 0.5 Asn + 0.5 Ala ^ 1 Ser + 0.5 Glu 
+ 1 GO 2 

1.58 

1.72 

1.77 

0.25 

5 

1 Gin + 1 Asp ^ 1 Asn + 1 Glu 

1.55 

1.52 

1.58 

1.39 

6 

1 Ser 1 Gly 

1.47 

1.67 

1.64 

0.33 

7 

1 Ala + 1 GO 2 1 Asp 

0.88 

0.95 

0.98 

0.35 

8 

1 Asp + 1 Gly ^ 1 Asn + 1 GO 2 

0.59 

0.67 

0.61 

0.09 

9 

0.16667 Tyr + 0.83333 Ala ^ 1 Asp 

0.57 

0.58 

0.61 

0.57 

10 

1 Lac + 1 Gly ^ 1 Ala + 1 CO 2 

0.54 

0.57 

0.66 

0.08 

11 

0.16667 Leu + 0.16667 Lac + 0.83333 Ala ^ 1 
Asp 

0.50 

0.49 

0.46 

0.38 

12 

1 Gin ^ 1 Glu + 1 NH+ 

0.34 

0.34 

0.28 

0.49 

13 

0.05195 Glc + 0.0656 Gin + 0.0278 Ser + 0.046 
Arg + 0.0552 Thr + 0.0552 Lys + 0.0644 Val + 
0.046 He + 0.2596 Leu + 0.0644 Phe + 0.0184 
Met + 0.0736 Lac + 0.0744 Gly + 0.2032 Glu + 
0.0184 Cys + 0.0184 His + 0.046 Pro + 0.0092 
Trp + 0.006 Ethanolamine + 0.0171 Choline ^ 
0.084 Asp + 0.218 CO 2 + 1 Biomass 

0.31 

0.31 

0.34 

0.43 

14 

1 Ala ^ 1 Lac + 1 NH^ 

0.29 

0.26 

0.34 

0.47 

15 

1 Ser ^ 1 Lac + 1 NH^ 

0.27 

0.24 

0.32 

0.07 

16 

1 Gly ^ 1 NH+ + 1 CO 2 

0.23 

0.29 

0.21 

0.08 

17 

0.5 Val ^ 0.5 Ala + 1 CO 2 

0.22 

0.22 

0.22 

0.22 

18 

0.5 Glc + 1 Arg ^ 1 Ser + 1 Glu 

0.21 

0.23 

0.23 

0.21 

19 

0.03155 Glc + 0.0756 Gin + 0.0368 Asn + 0.046 
Arg + 0.1982 Thr + 0.0552 Lys + 0.0644 Val + 
0.046 He + 0.0828 Leu + 0.0644 Phe + 0.0184 
Met + 0.3718 Lac + 0.1452 Ala + 0.0164 Glu + 
0.0184 Cys + 0.0184 His + 0.046 Pro + 0.0092 
Trp + 0.006 Ethanolamine + 0.0171 Choline ^ 
0.3726 CO 2 + 1 Biomass 

0.17 

0.17 

0.18 

0.06 

20 

0.5 Asp + 0.25 Phe =;■ 0.25 Ala + 0.5 Glu + 1 
CO 2 

0.14 

0.13 

0.17 

0.16 

21 

0.16667 Lys + 0.33333 Lac + 0.66667 Ala 1 

Asp 

0.14 

0.14 

0.14 

0.14 

22 

1 Ala + 0.2 Trp ^ 0.8 Asp + 0.4 Glu 

0.12 

0.11 

0.11 

0.12 

23 

0.03155 Glc + 0.0912 Gin + 0.0686 Ser + 0.0368 
Asn + 0.046 Arg + 0.0552 Thr + 0.0552 Lys + 
0.0644 Val + 0.046 He + 0.0828 Leu + 0.3296 
Phe + 0.0184 Met + 0.0736 Ala + 0.0744 Gly 
+ 0.0008 Glu + 0.0184 Gys + 0.0184 His + 0.046 
Pro + 0.0092 Trp + 0.006 Ethanolamine + 0.0171 
Choline ^ 0.2092 Asp + 0.3064 CO 2 + 1 Biomass 

0.11 

0.12 

0.08 

0.10 

24 

1 Pro 1 Glu 

S.52e-2 

8.19e-2 

8.15e-2 

8.52e-2 

25 

1 He =;. 1 Glu + 1 CO 

2 

8.25e-2 

7.99e-2 

7.99e-2 

8.25e-2 

26 

1 Cys 1 Lac + 1 NH^ 

5.68e-2 

4.13e-2 

2.29e-2 

7.20e-2 

27 

1 Gly + 1 Cys =;- 1 Asn 

5.23e-2 

7.53e-2 

9.46e-2 

3.71e-2 

28 

1 Thr =;. 0.5 Gly + 0.5 Glu + 0.5 CO 2 

4.43e-2 

4.39e-2 

4.19e-2 

5.92e-2 

29 

1 Ser + 1 Met ^ 1 Asp + 1 Cys 

3.78e-2 

3.85e-2 

3.92e-2 

3.78e-2 

30 

1 Asp + 1 His ^ 1 Asn + 1 Glu 

0.56e-2 

0.61e-2 

0.93e-2 

0.56e-2 

31 

1 His ^ 1 Glu + 1 NH+ 

0.56e-2 

0.65e-2 

0.33e-2 

0.56e-2 

32 

AAs 1 mAb 

0.02e-2 

0.02e-2 

0.02e-2 

0.02e-2 

33 

0.5 Glc ^ 1 CO 2 

- 

- 

- 

1.84 

Norm 


AxEw ^ X)fe=i Qk 


0 

0.22 

0.42 

0 

Rob. N. 


AxEw - ^ Y:i=i <lh 

2 _ 

+ 

463.38 

444.79 

442.41 

463.80 


Table 4: Fluxes of each EFM for medium 1, solved with varied assumed error. 
Wo, wo. 05 , and wi indicate the results with 0 at 0, 5, and 100 of the values given 
in Table[3]% respectively. wo,inv and wi^mv indicate the results with an interval 


given for CO 2 with 6* at 0 and 100 % respectively, tg = Qe {XA^Ew — Q) 

where 6 is given by Table [3] The unit of the fluxes is pmol • celP^ ■ day~^ 
except for Biomass for which the unit is day“^. AAs denotes amino acids. 


4.23 

4.20 

1.58 

0.13 

1.36 

0.27 

0.33 

0.04 

0.61 

0.06 

0.30 

0.50 

0.48 


0.52 

0.05 

0.05 

0.22 

0.23 

0.04 


0.17 

0.14 

0.11 

0.07 


8.15e-2 

7.99e-2 

5.79e-2 

5.97e-2 

5.22e-2 

3.92e-2 

0.89e-2 

0.37e-2 

0.02e-2 

2.24 

0.42 

442.41 
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Figure 2: Flux over each EFM, normed with respect to the full error robust 
solution, for medium 1. 


Metabolite 

90 

90.05 

91 

qo,inv 

qi,inv 


Ala 

0.44 

0.44 

0.44 

0.44 

0.44 

Table 5: Fluxes of 

Arg 

-0.24 

-0.26 

-0.26 

-0.24 

-0.26 

1 1 1 r 

Asn 

-0.18 

-0.18 

-0.18 

-0.18 

-0.18 

tJcidi iiitjLciljuliLfc^ iui 

Asp 

0.06 

0.07 

0.07 

0.06 

0.07 

medium 1 , solved with 

Biomass 

0.59 

0.60 

0.60 

0.59 

0.60 

varied assumed error. 

CO 2 

8.39 

8.76 

8.83 

6.98 

7.07 

Choline 

-0.01 

-0.01 

-0.01 

-0.01 

-0.01 

Wo, Wo. 05 , and wi in- 

Cys 

-0.08 

-0.09 

-0.09 

-0.08 

-0.09 

dicate the results with 

Bthanolamine 

-36e-4 

-36e-4 

-36e-4 

-36e-4 

-36e-4 

Glc 

-3.40 

-3.43 

-3.43 

-3.40 

-3.43 

9 at 0, 5, and fOO 

Gin 

-1.93 

-1.91 

-1.90 

-1.93 

-1.90 

% of its given value 

Glu 

0.28 

0.28 

0.28 

0.28 

0.28 

Gly 

0.04 

0.05 

0.05 

0.04 

0.05 

in Table |3] respec- 

His 

-0.02 

-0.02 

-0.02 

-0.02 

-0.02 

lively. wo^inv and 

He 

-0.11 

-0.11 

-0.11 

-0.11 

-0.11 

Lac 

6.40 

6.20 

6.20 

6.40 

6.20 

wi inv indicate the re- 

Leu 

-0.19 

-0.19 

-0.19 

-0.19 

-0.19 

suits with an interval 

Lys 

-0.06 

-0.06 

-0.06 

-0.06 

-0.06 

Met 

-0.05 

-0.05 

-0.05 

-0.05 

-0.05 

given for CO 2 with 0 

NH4 

1.19 

1.18 

1.18 

1.19 

1.18 

at 0 and 100 % re- 

Phe 

-0.10 

-0.10 

-0.10 

-0.10 

-0.10 

Pro 

-0.11 

-0.11 

-0.11 

-0.11 

-0.11 

spectively. The unit 

Ser 

-0.00 

-0.00 

-0.01 

-0.00 

-0.01 

of the fluxes is pmol • 
cell“^ ■ day“^, except 

Thr 

-0.10 

-0.10 

-0.10 

-0.10 

-0.10 

Trp 

-0.03 

-0.03 

-0.03 

-0.03 

-0.03 

Tyr 

-0.10 

-0.10 

-0.10 

-0.10 

-0.10 

for Biomass for which 
the unit is day~^. 

Val 

-0.15 

-0.15 

-0.15 

-0.15 

-0.15 

mAh 

2e-4 

2e-4 

2e-4 

2e-4 

2e-4 

Considering the results from medium five shown 

in Table H] it can be noted 


that for all levels of robustness one more EFM is required in the solution in 
order to reach optimality. When the flux to CO 2 is bounded a few EFMs are 
dropped and new ones are introduced. In general introducing a new metabo¬ 
lite in the model by giving an interval of its value, i.e., without experimental 
measurements, has a large impact on the solution than introducing robustness. 
Furthermore the intervals can be satisfied without a large change in the norm 
of the data fitting. This indicates that giving realistic intervals on unmeasured 
metabolites can help guide the solution. These results show that, here, larger 
improvements of the residual error are brought by improving the model struc¬ 
ture, i.e. introducing new metabolites, compared to taking into account the 
robust solution, which address the parameter estimation. 
























































Macroscopic R.eacti( 


^0 


"0.05 


0.5 Glc 1 Lac 

0.5 Glu ^ 0.5 Ala + 1 CO 2 

0.5 Glc + 1 Ala =>■ 1 Ser + 1 Lac 

0.5 Gin + 0.5 Lac ^ 1 Asp 

1 Ser 1 Gly 

1 Asp + 1 Gly ^ 1 Asn + 1 CO 2 
1 Asn 1 Lac 

0.5 Glu ^ 0.5 Lac + 0.5 NH+ + 1 CO 2 

0.5 Asn + 1 Lac 0.5 Ala + 0.5 Glu + 1 CO 2 

1 Ser ^ 1 Ala 

1 Gin 4- 1 Asp 1 Asn + 1 Glu 
0.5 Glc + 1 Arg ^ 1 Ser + 1 Glu 
0.25 Tyr => 0.25 Glu + 1 CO 2 
1 Asn ^ 1 Asp + 1 NH^ 

0.25465 Glc + 0.0656 Gin + 0.0468 Asn + 0.0616 Asp + 0.046 Arg 
+ 0.0552 Thr + 0.0552 Lys + 0.0644 Val + 0.046 He + 0.5446 Leu + 
0.0644 Phe + 0.0184 Met + 0.0736 Ala + 0.0744 Gly + 0.0184 Cys + 
0.0184 His + 0.046 Pro + 0.0092 Trp + 0.006 Ethanolamine + 0.0171 
Choline =>■ 0.3668 Glu + 0.9804 CO 2 + 1 Biomass 

0.027502 Glc + 0.063938 Gin + 0.066862 Ser + 0.035867 Asn + 
0.044834 Arg + 0.053801 Thr + 0.053801 Lys + 0.32125 Val + 
0.044834 He + 0.080702 Leu + 0.32125 Phe + 0.017934 Met + 
0.072515 Gly + 0.025731 Glu + 0.017934 Cys + 0.017934 His + 
0.044834 Pro + 0.0089669 Trp + 0.005848 Ethanolamine + 0.016667 
Choline 0.024951 Lac + 0.36569 Ala + 1 CO^ + 0.97466 Biomass 
+ 


3.20 

2.64 

2.62 

2.05 

1.94 

1.89 

1.60 

1.43 

1.24 

1.01 

0.54 

0.36 

0.34 

0.32 

0.30 


3.82 

2.72 

1.93 

2.27 

2.15 

2.09 

1.70 

1.62 

1.32 

0.22 

0.35 

0.46 

0.26 

0.20 

0.27 


2.30 

3.13 

3.44 

2.20 

2.33 

2.26 

1.74 

1.09 

1.11 

1.60 

0.37 

0.49 

0.28 

0.46 

0.23 


1 Gin ^ 1 Glu + 1 NH^ 

0.52941 Val + 0.17647 L-'eu + 0.058824 Pro 
Glu + 1 CO 2 
1 Thr 1 Asp 

0.33333 Asn + 0.33333 Asp + 0.33333 Lys + 0.33333 P 
Arg + 0.66667 Glu + 1 CO 2 
1 He + 1 Cys 1 Asp + 1 Glu 

0.16667 Glc + 0.33333 Asn + 0.33333 Met + 0.33333 P 


0.058824 Arg + 0.52941 


0.25 

0.12 


0.10 

0.10 


0.09 

0.08 


0.27 

0.22 


0.10 

0.12 


0.08 

0.05 


0.28 

0.22 


0.10 

0.12 


0.07 

0.05 


2.69 

2.53 

3.22 

1.49 

2.45 

1.12 

1.60 

1.13 

3e-7 

1.07 

0.73 

0.35 

0.26 

0.38 

2e-5 


0.32 

0.23 


0.10 

0.09 


0.09 

0.06 


2.02 

2.48 

3.72 

1.62 

2.32 

1.29 

1.69 

1.27 

1.89 

0.69 

0.50 

0.29 

0.42 


0.23 

0.20 


0.10 

0.12 


0.08 

0.05 


23 

0.25 Ala + 0.25 Trp => 

0.5 Glu + 1 CO 2 

0.07 

0.07 

0.06 

0.06 

0.08 

24 


Pro ^ 1 Glu 


0.05 

0.05 

0.05 

0.05 

0.05 

25 

0.028217 Glc + 0.0756 Gin + 0.0686 Ser + 0.0368 Asn + 0.0616 Arg 
+ 0.2044 Tyr + 0.0552 Thr + 0.0552 Lys + 0.0644 Val + 0.2228 He 
+ 0.0828 Leu + 0.0368 Phe + 0.0184 Met + 0.0744 Gly + 0.0008 
Glu + 0.0184 Cys + 0.0184 His + 0.046 Pro + 0.0092 Trp + 0.006 
Ethanolamine + 0.0171 Choline 0.224 Ala + 0.4956 CO 2 + 1 

Biomass 

0.04 

0.15 

0.17 

0.07 

0.05 

26 

0.028217 Glc + 0.0656 Gin + 0.5834 Ser + 0.046 Arg + 0.0276 Tyr 
+ 0.0552 Thr + 0.3204 Lys + 0.0644 Val + 0.046 He + 0.0828 Leu + 
0.0368 Phe + 0.5332 Met + 0.0744 Gly + 0.0264 Glu + 0.0184 His + 
0.046 Pro + 0.0092 Trp + 0.006 Ethanolamine + 0.0171 Choline ^ 
0.3484 Ala + 0.468 NH^ + 0.9736 CO^ + 0.4964 Cys + 1 Biomass 

0.03 

0.05 

0.06 

0.04 

0.05 

27 

1 Cys 1 Ala 


0.03 

0.06 

0.07 

0.04 

0.06 

28 

0.024997 Glc + 0.058115 Gin + 0.060773 Ser + 0.032601 Asn + 
0.06343 Asp + 0.040751 Arg + 0.024451 Tyr + 0.048901 Thr + 
0.048901 Lys + 0.057052 Val + 0.040751 He + 0.073352 Leu + 
0.032601 Phe + 0.0163 Met + 0.065911 Gly + 0.038979 Cys + 0.0163 
His + 0.050319 Pro + 0.25 TTp + 0.0053154 Ethanolamine + 0.015149 
Choline => 0.022679 Lac + 0.17665 Ala + 1 CO 2 + 0.8859 Biomass 

0.02 

0.03 

0.04 

0.03 

0.02 

29 


Lac 4- 1 His 1 Ala + 1 Glu 

99e-4 

118e-4 

175e-4 

lOOe-4 

151e-4 

30 


His ^ 1 Glu + 1 NH 

+ 

4 

99e-4 

92e-4 

35e-4 

99e-4 

59e-4 

31 

AAs ^ 1 mAb 


2e-4 

2e-4 

2e-4 

2e-4 

2e-4 

32 

0.33333 Asp + 0.33333 

Phe 0.66667 Glu + 1 CO 2 

- 

0.18 

0.18 

0.21 

0.24 

33 


Lac + 1 Gly => 1 Ala + 1 CO 2 

- 

- 

- 

1.28 

0.95 

34 

0.028217 Glc + 0.0656 Gin + 0.0686 Ser + 0.0368 Asn + 0.0716 Asp 
+ 0.046 Arg + 0.0276 Tyr + 0.0552 Thr + 0.0552 Lys + 0.0644 Val 
+ 0.046 He + 0.343 Leu + 0.0368 Phe + 0.0184 Met + 0.2346 Lac + 
0.0736 Ala + 0.0744 Gly + 0.0184 Cys + 0.0184 His + 0.046 Pro + 
0.0092 Trp + 0.006 Ethanolamine + 0.0171 Choline ^ 0.2238 Glu + 
0.0368 CO 2 + 1 Biomass 




0.45 

0.42 



Aa:B-m - ^ Efc = i Ik 

i2 

0.00 

0.69 

2.27 

0.01 

2.27 



AxBw - ^ Efc = i Qk 

12 T” 

+ 1 tg 

1151.23 

1073.57 

941.05 

1154.05 

945.09 


Table 6: Fluxes of each EFM for medium 5, solved with varied assumed error. 
Wo, Worn, and wi indicate the results with 9 at 0, 5, and 100 of the values given 
in Table[3]% respectively. wo,inv and wi^mv indicate the results with an interval 


given for CO 2 with 0 at 0 and 100 % respectively, tg = Qe {XA^Ew — Q) 

where 9 is given by Table [3] The unit of the fluxes is pmol • celP^ • day~^ 
except for Biomass for which the unit is day~^. AAs denotes amino acids. 
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EFM 


Figure 3: Flux over each EFM, normed with respect to the full error robust 
solution, for medium 5. 


Metabolite 

90 

90.05 

91 

qo,inv 

qi,inv 


Ala 

0.44 

0.44 

0.44 

0.45 

0.44 

Table 7: Fluxes of 

Arg 

-0.32 

-0.42 

-0.45 

-0.32 

-0.45 

1 1 1 Ti r 

Asn 

-0.20 

-0.20 

-0.20 

-0.20 

-0.20 

tJcidi iiitjLciljuliLfc^ lui 

Asp 

0.08 

0.08 

0.08 

0.08 

0.08 

medium 5, solved with 
varied assumed error. 

Biomass 

0.64 

0.57 

0.57 

0.64 

0.57 

co^ 

8.47 

9.11 

8.95 

7.09 

7.09 

Choline 

-0.01 

-0.01 

-0.01 

-0.01 

-0.01 

yjo, Wo. 05 , and wi in¬ 
dicate the results with 

Cys 

-0.12 

-0.12 

-0.12 

-0.12 

-0.12 

Bthanolamine 

-38e-4 

-34e-4 

-34e-4 

-38e-4 

-34e-4 

Glc 

-3.19 

-3.19 

-3.19 

-3.16 

-3.14 

9 at 0, 5, and fOO 

Gin 

-1.85 

-1.79 

-1.79 

-1.84 

-1.77 

% of its given value 

Glu 

0.24 

0.23 

0.23 

0.24 

0.24 

Gly 

0.01 

0.02 

0.03 

0.01 

0.03 

in Table |3l respec¬ 
tively. wo^inv and 

His 

-0.03 

-0.03 

-0.03 

-0.03 

-0.03 

He 

-0.13 

-0.13 

-0.13 

-0.13 

-0.11 

Lac 

5.95 

5.84 

5.84 

6.00 

6.24 

wi^inv indicate the re¬ 
sults with an interval 

Leu 

-0.21 

-0.21 

-0.19 

-0.21 

-0.19 

Lys 

-0.07 

-0.09 

-0.09 

-0.07 

-0.09 

Met 

-0.05 

-0.05 

-0.05 

-0.05 

-0.05 

given for CO 2 with 0 
at 0 and 100 % re- 

NH+ 

1.30 

1.31 

1.31 

1.30 

1.31 

Phe 

-0.11 

-0.11 

-0.11 

-0.11 

-0.11 

Pro 

-0.14 

-0.15 

-0.15 

-0.14 

-0.15 

spectively. The unit 

Ser 

-0.01 

-0.02 

-0.06 

-0.01 

-0.06 

of the fluxes is pmol • 
cell“^ ■ day“^, except 

Thr 

-0.13 

-0.13 

-0.13 

-0.13 

-0.13 

Trp 

-0.03 

-0.03 

-0.03 

-0.03 

-0.03 

Tyr 

-0.09 

-0.10 

-0.11 

-0.09 

-0.10 

for Biomass for which 
the unit is day~^. 

Val 

-0.17 

-0.17 

-0.18 

-0.17 

-0.15 

mAh 

2e-4 

2e-4 

2e-4 

2e-4 

2e-4 


5 Conclusion 


In this work we have examined the effect of errors in measurement on the solu¬ 
tion of the EFMs-based MFA. The approach has been to derive a robust form of 
the EFMs-based MFA, a form that considers measurement errors more directly, 
in the sense that each value based on measurements is given an error interval 
and the aim is to minimize the maximum possible error on the given interval 
with respect to the least-squares measure. These types of robust optim ization 
problems are in general not easily solved (|E1 Ghaoui and Lebretl . ll997T) . How¬ 
ever, for this special case we showed that the robust form can be stated as a 
convex quadratic optimization problem where column generation can be applied 
to achieve an optimal solution. Furthermore, it was demonstrated that an exter¬ 
nal metabolite could be taken into account in the model when no measurements 
exists this metabolite, by considering an interval of its value. 

By considering the worst-case scenario, we see how unfavourable the fit could 
be within the given error intervals. The case-study compares the solution of the 
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non-robust and robust EFMs-based MFA. Those results demonstrate that the 
optimal solution to the robust problem is similar to the optimal solution to the 
non-robust formulation, i.e., the FFMs-based MFA is rather robust to those 
assumed errors. This indicates that the errors on measurement do not induce a 
large change in the solution. 

Measurement errors were known to be around 20%, therefore it was impor¬ 
tant to consider the effects of those errors, especially with respect to which 
FFMs are used in the optimal solution. Our second contribution, the addition 
of intervals, is especially relevant for metabolites for which known intervals are 
available in the literature but are problematic to measure in the experimental 
setup. In fact, the addition of intervals had a larger effect on the solution than 
the addition of around 20% errors in measurements in the presented cases. The 
robust solution addressed the parameter estimation, while adding a metabolite 
by considering its interval, addressed the model structure. We showed a way 
to include the knowledge of unmeasured metabolites in the column generation 
method. This important result allowed to achieve improvements of the model 
structure. This approach can be a general strategy to improve a model struc¬ 
ture by introducing a new metabolite in a model. It can also be an approach to 
identify which variables should be measured when designing a new experiment. 
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A Derivation of the Robust Quadratic Program 

The robust formulation given by dZD can be represented as a quadratic program. 
To show this, consider first a reformulation of the two-norm problem, 

minimize maximize — WXAtEw — Q + AQII 
U'>0 |AQd<e„|Qd 2 


=mmimize 


- maximize (XA^Ew — Q + AQ)? , 
9 ^i lAni^oioi T'/i' 


>0 2 ^ |AQd<e,|Q,| 

1 " 

=min - max iXA^Ew — Q)i + 2 {XA^Ew — Q) AQs + AQI 

w>0 2 \AQs\<es\Qs\ 

S — 1 


=niin 

iu>o 2 


in 1 

- iXA^Ew - Q)l -I- - max 2 {XAa;Ew - Q)^ AQs + AQ^. 

^ Z |AQs|<@s|Qs| 


(15) 


The above derivation uses that the bound on AQ is elementwise, thus the maxi¬ 
mization can be moved inside the sum. Considering now the inner maximization. 
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• if {lA^Ew — Q)s < 0 then a AQ* that maximizes the norm is as negative 

as possible, i.e., = —0s|(5s|, 

• if {ZAxEw — Q)s A 0 then a AQ* that maximizes the norm is as positive 
as possible, i.e., AQ* = IQsl- 

Thus, the maximum is given by AQ* = sg'D.{{IAxEw — (5)s)0s|Qs|, 

maximize 2 [lA^Ew — AQs + AQ^ 

|AQ,|<e,|Q,| 

=2 (IA,^Ew - Q)^ sgn{{IAr,Ew - Q)s)ds |Qs| 

+ {sgn{{IAa:Ew - Q)s)0s IQsl)^, 

=2| {IA,,Ew - Q)^ 9sQs\ + {OsQs?- 

Then, we can set 4 = \{IAxEw — 0sQs| rewriting (ITHll to 

1 " 1 
minimize - \\XA,^Ew - Q||^ + + -{d^Q)'^ 

w,t z z 

fc=l 

subject to ts > {XAa;Ew — Q)^ 0sQs Vs, (16) 

ts > - (lA^Ew - Q)^ OsQs Vs, 
w >0. 

The constant [O'^QY can be disregarded from the objective function of ((19 and 
thus obtaining ((8])- 


B Deriving the Master- and Subproblem 

In this section we will derive the master- and subproblem for the column gener¬ 
ation of ((9 ■ For the master problem, consider when the columns of E are split 
into the known and unknown columns Eb and E^ respectively, the variable 
vector w is divided in the same way, wb and wn- Furthermore, wn is fixed to 
zero, since E^- represents unknown EFMs. This gives the problem to, 

2 

- q) > 0, 

; (17) 

- Qj > 0, 

Wb > 0 , 

Wn = 0 . 


minimize — 

w.t 2 


Wb 

Wn 


XAx [Eb En] 

subject to t — QQ (^A^ [Eb En] 
t + QQ (xA, [Eb En] 


-Q 

Wb 

Wn 

Wb 

Wn 


The formulation given by (1171) is equivalent to the master problem stated in 
(II9, by inserting wn = 0 into the objective function and constraints in (II9, 
can be obtained. 

The subproblem comes from comparing the optimality conditions of (ED and 
(HD. Because (|HD is convex, any point that satisfies the optimality conditions of 
dHD is an optimal solution. Thus, the optimality conditions of ((9 and ED can 
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be compared, identifying when a point that satisfies the optimality conditions of 
(HZl) also satisfies the optimality conditions for (|5]). The optimality conditions 
of ([5]) are given as, 


w >0, 
A >0, 

A„i >0, 


Ap >0, 

X^w = 0 , 


A^(f - QQIA.Ew + QQQ) =0, 


Aj(f + QQIA^Ew - QQQ) =0, 


'E^All^{IA^Ew - Q) 


'A 


'-E'^All'^QQ 

1 


0 


I 


'E^A^J^QQ 

I 


Ap=0. 


(18a) 

(18b) 

(18c) 

(18d) 

(18e) 

(18f) 

(18g) 


(18h) 


Note that the last condition Am + Ap = 1, together with complementarity, 
and that the two constraints are mutually exclusive, require that exactly one 
constraint is active, which is consistent with what we expect the problem to do. 
Consider now the optimality conditions of (II3, 


wn = 0 wb >0, 

(19a) 

An free Ab >0, 

(19b) 

IV 

o 

(19c) 

o' 

Al 

(19d) 

A^wb =0, 

(19e) 

A^(f - QQIA.Ebwb + QQQ) =0, 

(19f) 

Aj(t + QQIA.Ebwb - QQQ) =0, 

(19g) 

1 —0: 

(19h) 


'ElAlt^{XA,EBWB - Q) 


Ab 

1 

'ElAli^QQ 

ElAlX'^iXA^EBWB - Q) 


An_ 

“T 

ElAlt^QQ_ 


'ElAll'^QQ 

_eIaIttqq 


Ap —0. 


(19i) 


The differences of these two optimality conditions come from Aat being free, 
thus, a A can be negative contradicting (jl8bl) . Hence, the subproblem should 
identify if there is a An < 0, this can by done by an optimization problem where 
(119111 gives an objective function. The subproblem can then be stated as to, 

min A^I^(IAxEbWb — Q + QQAm — QQAp). 

e6£iv 


The requirement that e G can be fulfilled by 

{e : AiC = 0, l^e < 1, ej > 0 Vj}, 


and by ensuring that the solution is an extreme point solution. Together these 
constraints and objective function define an optimizati on problem that identifi es 
new EFMs that can be added to the master problem ( Oddsdottir et all . 12014 1. 
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